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Abstract. Assume that for some a < 1 and for all nutural n a set F n of 
at most 2 an "forbidden" binary strings of length n is fixed. Then there 
exists an infinite binary sequence cj that does not have (long) forbidden 
substrings. 

We prove this combinatorial statement by translating it into a statement 
about Kolmogorov complexity and compare this proof with a combina- 
torial one based on Laslo Lovasz local lemma. 

Then we construct an almost periodic sequence with the same property 
(thus combines the results from [T] and [2]). 

Both the combinatorial proof and Kolmogorov complexity argument can 
be generalized to the multidimensional case. 



1 Forbidden strings 

Fix some positive constant a < 1. Assume that for each natural n a set F n 
of binary strings of length n is fixed. Assume that F n consists of at most 2 an 
strings. 

We look for an infinite binary sequence ui that does not contain forbidden 
substrings. 

Proposition 1. There exists an infinite binary sequence uj and a constant N 
such that for any n > N the sequence uj does not have a substring x of length n 
that belongs to F n . 

One may consider strings in F n as "forbidden" strings of length n; proposi- 
tion then says that there exists an infinite sequence without (sufficiently long) 
forbidden substrings. 

For example, we can forbid strings having low Kolmogorov complexity. Let 
F n be the set of all strings of length n whose complexity is less than an. Then 
^Fn does not exceed 2 an (there are at most 2 an programs of size less than an). 

Therefore Proposition Q] implies the following statement that was used in Q]: 

Proposition 2. For any a < 1 there exists a number N and an infinite 
binary sequence uj such that any its substring x of length greater than N has 
high complexity: 

K(x) > a\x\. 
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Here K(x) stands for Kolmogorov complexity of x (the length of the shortest 
program producing x, the definition is given in [3 ); it does not matter which 
version of Kolmogorov compexity (prefix, plain, etc.) we consider since the log- 
arithmic difference between them can be compensated by a small change in a. 
The notation |a;| means the length of string x. 

Our observation is that the reverse implication is true, i.e., Proposition [2] 
implies Proposition!]] It is easy to see if we consider a stronger version of Propo- 
sition [2] when K is replaced by a relativized version where A is an arbitrary 
oracle (an external procedure that can be called). Indeed, consider the set of all 
forbidden strings as an oracle. Then the relativized complexity of any string in 
F n does not exceed an + 0(1) since its ordinal number in the length-sorted list 
of all forbidden strings is at most J2k<n ^ ~ 0(2 an ). The constant 0(1) can 
be absorbed by a small change in a, and we get the statement of Proposition [TJ 

More interestingly, we can avoid relativization and derive Proposition [T] from 
(non-relativized) PropostionO It can be done as follows. 

First note that we may assume (without loss of generality) that a is rational. 
Assume that for some set F of forbidden strings the statement of Proposition Q] 
is false. Then for each c e IN there exists a set F° with the following properties: 

(a) F c consists of strings of length greater than c; 

(b) F c contains at most 2 ak strings of length k for any k; 

(c) any infinite binary string has at least one substring that belongs to F c . 

(Indeed, let F c be the set of all forbidden strings that have length more 
than c.) 

The statement (c) can be reformulated as follows: the family of open sets 
S x for all x € F c covers the set £1 of all binary sequences, where S x is a set 
of all sequences that have substring x. The standard compactness argument 
implies that F c can be replaced by its finite subset, so we assume without loss 
of generality that F c is finite. 

The properties (a), (b) and (c) are enumerable (for finite F c ): each S x is an 
enumerable union of intervals, so if the sets S x for x £ F c cover J?, this can 
be discovered at a finite step. (In fact, they are decidable, but this does not 
matter.) So the first set F c encountered in the enumeration (for a given c) is a 
computable function of c. 

Now we can construct a decidable set of forbidden strings that does not satisfy 
the statement of Proposition [1] Indeed, construct a sequence c\ < ci < C3 < . . . 
where c^+i is greater than the length of all strings in F Ci and take the union of 
all F Ci . We obtain the decidable set F such that F contains at most 2 ak strings 
of length k for any fc, and any infinite binary string has (for any i) at least one 
substring of length greater that a that belongs to F. For this decidable set we 
need no special oracle, q.e.d. 

The proof of Proposition [5] given in [I] uses prefix complexity. See below 
Section [3] where we prove the stronger version of this Proposition needed for our 
purposes. 
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2 Combinatorial proof 

The statement of Proposition [1] has nothing to do with Kolmogorov complexity. 
So it would be natural to look for a combinatorial proof. 

The simplest idea is to use the random bits as the elements of the sequence. 
Then the probability of running into a forbidden string in a given k positions 

is bounded by 2~( 1 ™") fc , i.e., exponentially decreases when k — > oo. However, 
the number of positions where a forbidden string of a given length can appear is 
infinite, and the sum of probablities is infinite too. And, indeed, a truly random 
sequence contains any string as its substring, so we need to use something else. 

Note that two non-overlapping fragments of a random sequence are inde- 
pendent. So the dependence can be localized and we can apply the following 
well-known statement: 

Proposition 3 (Laslo Lovasz local lemma). Let G be a graph with vertex 
set V = {v\, . . . , v n } and edge set E. Let Ai be some event associated with vertex 
Vi. Assume that for each i the event Ai is independent with the random variable 
"outcomes of all Aj such that Vj is not connected to Vi by an edge". Letpi S (0, 1) 
be a number associated with Ai in such a way that 

where the product is taken over all neighbour vertices Vj (connected to Vi by an 
edge). Then 

n 

Pi[neither of Ai happens] > TT(1 — Pi) 

i=l 

and, therefore, this event is non-empty. 

The proof of this Lemma could be found, e.g., in [4], p. 115. 

To apply this Lemma to our case consider a finite random string of some 
fixed length N where all bits are independent and unbiased (both outcomes 
have probability 1/2). Consider a graph whose vertices are intervals of indices 
(i.e., places where a substring is located) of length at least L (some constant to 
be chosen later). Two intervals are connected by an edge if they are not disjoint 
(share some bit). For each interval v consider the event A v : "substring of the 
random string located at v is forbidden". This event is independent with all 
events that deal with bits outside v, so the independence condition is fulfilled. 

Let p v = 2~ <5 I I 'I for all v and some S (to be chosen later). To apply the lemma, 
we need to prove that 

Pv[A v ]<p v Y[ (l-p w ). 

v and w are 
not disjoint 
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Let I > L be the length of the string v and let 

r= n c 1 -^)- 



v and tu are 
not disjoint 

Then 

N 

R> Y[ (1 - 2~ 5k ) l+k 

k=L 

(strings w have length between L and N and there are at most I + k strings of 
length k that share bits with v), and 



R > 



na -2- 



d ~ fc ) 
fc>£ 



na 2 



fc>L 



(we split the product in two parts and replace finite products by infinite ones). 
The product Y\(l — ei) converges if and only if the series Yl £ i converges. The 
corresponding series 

2 ~ Sk and J2 k ' T 

k>L k>L 



do converge. Therefore both products converge and for a large L both products 
are close to 1: 

R > C[C 2 > D l 

where C\ , C2 and D are some constants that could be made close to 1 by choosing 
a large enough L (not depending on I). Then 

p v R > 2~ sl D l > 2- sl 2^ 1 = 2- {s+ ^ 1 , 

where 7 = — log D could be arbitrarily small for some L. We choose S and L in 
such a way that 5 < (1 — a)/2 and 7 < (1 — a)/2. Then 

p v R > 2- {1 - a)l > Yt[A v ] 

(forbidden strings form a 2 _ ( 1_Q ' i -fraction of all strings having length I) and 
conditions of Lovasz lemma are fulfilled. 

So we see that for some large L and for all sufficiently large N there exists a 
string of length N that does not contain forbidden strings of length L or more. 
Standard compactness argument shows that there exists an infinite binary string 
with the same property. 

This finishes the combinatorial proof of Proposition [T] 

Note that this combinatorial proof hardly can be considered as a mere trans- 
lation of Kolmogorov complexity argument. Another reason to consider it as a 
different proof is that it has a straightforward generalization for several dimen- 
sions. (The Kolmogorov complexity argument has this too, as we see in Section[5l 
but requires significant changes.) 
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A G?-dimensional sequence is a function w. 7Zi d —¥ {0, 1}. Instead of substrings 
we consider d-dimensional "subcubes" in the sequence, i.e., restrictions of oj to 
some cube in 7Z d . For any n there are 2" different cubes with side n. Assume 
that for every n > 1 a set F n of not more than 2 an "forbidden cubes" is fixed. 

Proposition 4. There exists a number L and d- dimensional sequence that 
does not contain forbidden subcube with side greater than L. 

The proof repeats the combinatorial proof of Proposition[T]with the following 
changes. The bound for R now is 

N 

R> l[(l-2- 5kd Y l+ V d , 

k=L 

since there are at most (/ + k) d cubes with side k intersecting a given cube 
with side I. Then we represent (I + k) d as a sum of d + 1 monomials and get 
a representation of this bound as a product of infinite products, each for one 
monomial. Every product has the following form (for some i in ... d and for 
some Ci that depends on d and i, but not k and /): 



-&k d \c i l i k j 



k>L 



-I Cii 



Y[(i-2- sk Y 



k>L 



The corresponding series obviously converge (due to the same reasons as before), 
and again we can make expression [. . .] as close to 1 as needed by choosing L 
(and again the choice of L does not depend on I). Then the estimate for R takes 
the form: 

i d 



> D 



i=0 i=l U=l 

where Cj, Di, C and D are some constants, and C and D could be made as close 
to 1 as needed. 

Then the proof goes exactly as before. 



3 Construction of almost periodic sequences 

A sequence is called almost periodic if each of its substrings has infinitely many 
occurences at limited distances, i.e., for any substring x there exists a number k 
such that any substring y of lo of length k contains x. 

The following result is proven in [5] (in the paper almost periodic sequences 
were called strongly almost periodic sequences): 

Proposition 5. Let a < 1 be a constant. There exists an almost periodic 
sequence uj such that any sufficiently long prefix x of lo has large complexity: 
K(x) > a\x\. 
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Comparing this statement with Proposition [21 we see that there is an ad- 
ditional requirement for the sequence to be almost periodic; on the other hand 
high complexity is guaranteed only for prefixes (and not for all substrings). 

Now we combine these two results: 

Proposition 6. Let a < I be a constant. There exists an almost periodic 
sequence uj such that any sufficiently long substring x of uj has large complexity: 
K(x) > a\x\. 

The paper [2j provides a universal construction for almost periodic sequences. 
Now we suggest another, less general construction that is more suitable for our 
purposes. 

Namely, we define some equivalence relation on the set of indices (IN). Then 
we construct a sequence 

UJ = UJQUJ1UJ2 ■ ■ ■ 

with the following property: i = j =>• Wj = uy. In other words, all the places that 
belong to one equivalence class carry the same bit. This property guarantees 
that uj is almost periodic if the equivalence relation is chosen in a proper way. 

Let no, rii, Ti2, ■ ■ ■ be an increasing sequence of natural numbers such that 77^+1 
is a multiple of ni for each i. The prefix of length no, i.e., the interval [0,no), 
is repeated with period n\. This means that for any i such that < i < no the 
numbers 

i, i + ni, i + 2ni, i + 3ni, . . . 

belong to the same equivalence class. In the similar way the interval [0,ni) is 
repeated with period ni'- for any i such that < i < n2 the numbers 

i, i + n 2 , i + 2n 2 , i + 3n 2 , . . . 

are equivalent. (Note that n 2 is a multiple of ni, therefore the equivalence classes 
constructed at the first step are not changed.) And so on: for any i 6 [0, n s ) and 
for any k the numbers i and i + kn s+ i are equivalent. 



[joX^^ i j i j "o i„TT7r37j^j "n i j n « [f > 

Fig. 1. Primary (shaded) and secondary bits in a sequence 

The following statement is almost evident: 

Proposition 7. // a sequence uj respects this equivalence relation, i.e., the 
equivalent positions have equal bits, then the sequence in almost periodic. 
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Indeed, in the definition of an almost periodic sequence we may require that 
each prefix of the sequence has infinitely many occurences at limited distances 
(since each substring is a part of some prefix) . And this is guaranteed: any prefix 
of length I < n s appears with period n s +i- 

The same construction can be explained in a different way. Consider the 
positional system where the last digit of integer a; is a; mod no, the previous 
digit is (x div no) mod n\ etc. Then all numbers of the form . . . Oz (for any given 
z G [0,no)) are equivalent; we say that they have rank 1. Then we make (for 
any y, z such that y ^ 0) all numbers of the form . . . Oyz equivalent and assign 
rank 2 to them, etc. 

If the sequence of periods no < n\ < n 2 < ■ ■ ■ is growing fast enough, then 
the equivalence relation does not restrict significantly the freedom of bit choice: 
going from left to right, we see that most of the bits are "primary" bits (are 
leftmost bits in their equivalence class, not copies of previous bits; these copies 
are called "secondary" bits, see Fig. [1]). 

Indeed, bits of rank 1 start with no primary bits, these bits are repeated as 
secondary bits with period ni, so secondary bits of rank 1 form a no/ni-fraction 
of all bits in the sequence; secondary bits of rank 2 form a ni /^-fraction etc. 
So the sum ^-j- is the upper bound of the density of "non-fresh" bits. More 
precise estimate: prefix of any length N has at least DN fresh bits where 

D = JJ(1 - rii/m+i). 

i 

This gives a simple proof of Proposition^ For a given a choose a computable 
sequence n < n\ < n 2 < . . . that grows fast enough and has D > a. Then take 
a Martin-Ldf random sequence £ and place its bits (from left to right) at all 
free positions (duplicating bits as required by the equivalence relation). We get 
an almost periodic sequence uj; at least DN bits of £ can be algorithmically 
reconstructed from w's prefix of length N. It remains to note that algorithmic 
transformation cannot increase complexity and that complexity of m-bit prefix 
of a random sequence is at least m — o(m) (it would be at least m for monotone 
or prefix complexity, but could be O(logm) smaller for plain complexity). 

4 Proof of the main result 

Could we apply the same argument (with sequence u> from Proposition [2] instead 
of a random sequence) to prove Proposition [6]? Not directly. To explain the 
difficulty and the way to overcome it, consider the simplified picture where only 
the equivalence of rank 1 is used. Then the sequence constructed has the form 

w = A B Q A B x A B 2 A B 3 A . . . 

where A is the group of primary bits of rank 1 (repeated with period ni); A and 
Bi are taken from a sequence 



£, — A Bq B\ £?2 S3 • 
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(provided by Proposition [2J . If some substring x of uj is located entirely in A 
or some Bi, its high complexity is guaranteed by Proposition [2] However, if x 
appears on the boundary between A and Bi for some i > 0, then x is composed 
from two substrings of £ and its complexity is not guaranteed to be high. 

To overcome this difficulty, wc need the following stronger version of Propo- 
sition [2] 

Proposition 8. For any a < 1 there exists a number N and an infinite 
binary sequence lj such that any its substring 

X = LO n UJ n+ iUJ n+ 2 . . . W„+fc-l 

of length k > N has high conditional complexity with respect to previous bits: 
K(w„w„ + iw n+2 . . . Lu n+ k-i | wo^iw 2 • ■ • w„-i) > ak. 

The proof follows the scheme from [Tj. Let (3 < 1 be greater than a. Let m 
be some integer number (we will fix it later) . Let the first to bits of to be the 
sequence x of length to with maximal prefix complexity (denoted by KP). Then 
add the next m bits to get the maximal prefix complexity of the entire sequence. 
This increase would be at least m — O(logm). 

[Indeed, for any strings x and y we have 

KP(x, y) = KP(x) + KP(y \ x, KP(.x)) + 0(1); 

(Kolmogorov - Levin theorem); if y has been chosen to maximize the second 
term in the sum, then KP(y | . . .) > \y\ and KP(i,y) > KP(x) + \y\ — 0(1). 
Therefore, for this y 

KY{xy) > KP(x,2/)-KP(|y|)-0(l) > KP(x) + \y\ - 0(log \y\), 

since (x, y) can be reconstructed from xy and \y\ and KP(|y|) = 0(log \y\). See [1] 
for details.] 

Then wc add string z of length m that maximizes KP(xyz) and so on. 

In this way we construct a sequence u = xyz . . . such that the prefix com- 
plexity of its initial segments increases by to — c log m for every added block of 
to bits. We can choose to such that to — clog to — 0(1) > (3m. 

Then the statement of the Proposition follows from Kolmogorov - Levin 
theorem if the substring is "aligned" (starts and ends on the boundaries of 
length to blocks). Since to is fixed, the statement is true for non-aligned blocks of 
large enough length (boundary effects are compensated by the difference between 
a and ft). 

Proposition [8] is proven. 

Let us explain why this modification helps in the model situation considered 
above. If a substring x of the sequence ABqAB\AB^ ... is on the boundary 
between A and some Bi, then it can be split into two parts xa and xb- The 
string xa is a substring of A and therefore has high complexity. The string xb 
is a substring of some Bi and therefore also has high complexity and even high 
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conditional complexity with respect to some prefix containing A. If we prove 
that xa is simple relatively this prifix we can use Kolmogorov - Levin theorem 
to prove that x has high complexity. 

Similar arguments work in general case when we have to consider bits of all 
ranks. To finish the proof we need the following Lemma: 

Lemma. Let uj be the sequence satisfying the statement of Proposition [3 
Then 

K(7(a 0) b ), Viauh), 7(a s _x, 6 s _x)) > 

aL - 0(s\ogL) - K(ao | a\) - K(ai | a%) — ... — K(a s - 2 | a s _i) 

for any clq < b$ < ai < b\ < ... < a s _i < b s -\, where V(a,b) stands for 
uj a uj a+ i . . .u>b-i and L — (bo - a ) + (&x - ax) + . . . + (& s -i - a s _i). 

In fact, for Proposition |6] we need only the case s = 3 of this Lemma. 

The proof of Lemma is based on Kolmogorov - Levin theorem about complex- 
ity of pairs. The statement of Proposition [8] guarantees the following inequality: 

K(V(a s _i, | V(0,a s ^)) > a(b s ^ - a s ^) - 0(logL). (*) 

We will prove the following inequality of any i — 0, 1, . . . , s — 2: 

K(V(a i ,bi),V(a H .i,b i+1 ),...,V(a,-i,b.- 1 ) \ V(0,Oi))- 

K(V(a i+1 ,b i+1 ), . . . , K(o,_i, 6 a _x) | 7(0, a i+1 )) > (**) 
a(&i - - O(logL) - K(a, | a i+1 ). 

If we add up (**) for all i = 0, 1, . . . , s — 2 with (*) we obtain the required 
inequality (and even stronger one with relative complexity in the left-hand 
side). Let us prove the inequality (**) now. By W we denote the sequence 
(V(a,i+i, bi+x), . . . , V(a s -x, fr s -i))- The following inequality follows from the Kol- 
mogorov - Levin theorem and the statement of Proposition [S] 

K(y(ai,bi),W | V(0, a,)) - K(W \ V(0, a,), V(a h b t )) = 

K(V(oi, h) | V(0, ai )) - 0(log L) > a(h - a 4 ) - 0(log L). 

To finish the proof of Lemma, let us prove the inequality 

K(W | V(0,a i+ i)) <K(W | V(0,a l ),V(a l ,b l ))+K(a l \ a i+1 ) + O(logi). 

One can obtain W from 7(0, a^ + i) in the following way: find Oj+i using the length 
of the string 7(0, a^+i), convert dj+x m to ai by the shortest program, compute &i 
by adding difference bi — ai to ai, cut intervals [0,Oi) and [oi,6,) from string 
7(0, aj+x) and execute the shortest program that converts (7(0, Oj), 7(ai, 6,)) 
into 17. This needs K(I7 | 7(0, a,), V(a u h)) + K(a t \ a l+1 ) + O(logL) bits to 
obtain 17 from 7(0, fli+x)- The inequality is proven, q.e.d. 

The proof of Proposition [5] uses the same construction as proof of Proposi- 
tion [S] but it takes a sequence satisfying the statement of Proposition [5] instead 
of a random sequence. 

Let v be a sequence satisfying the statement of Proposition [8] with some 
a' > a and uj be the resulting sequence (if we apply the construction of an 
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almost periodic sequence to the sequence v). It has been proved before that uj is 
an almost periodic sequence. We need only to prove the following estimate of a 
complexity of any substring of to: 

K(uj m uj m+ iUj m+2 . . .io m+k -i) > ak. 

for any sufficiently long k and for any m. 

Suppose that sequence {rij} grows fast enough, i.e. Y^jLi ~7T i < ~ 2° - Sup- 
pose i is the smallest index such that n» > k. Due to our construction of se- 
quence uj any element of uj corresponds to some element of v. Different elements 
of uj m uj m+ \ . . . w m _|_fc_i of rank not less than i (i.e. elements repeated with pe- 
riod m or greater by our construction) correspond to different elements of v 
because the distance between elements of the given substring of uj is less than rn 
(and less than the period). It is easy to prove that in this substring the density 
of elements of small rank (less than i) is not greater than a' — a. 

Indeed, the number of elements of rank j on any interval of length rij is 
equal to rij-i and we can cover the given interval of length k with at most 
^7 + 1 intervals of length rij . Therefore the number of elements of rank j on the 

given interval is not greater than rij-i + lj ■ So the density of elements of 

rank less than i in the given substring is not greater than X^=i ( "n~ X + ^ < 
2^.~ 1 < a' — a due to our assumption about growing of {rij}. 

Hence the substring uj rn . . . uj m+ k-i corresponds to some intervals in v. Throw 
away all elements of small ranks from these intervals of v and denote the remain- 
ing intervals by [do, 6o), . . . , [a s _i, & s -i), where a < b < . . . < a s _! < 6 s _i. The 
number of these intervals is at most 3. Indeed, we can enumerate all elements of 
uj m . . .w m+ fc_! from left to right, not counting elements of small ranks, and for 
each element find the corresponding element of v. The index of corresponding 
clement will increase by 1 every time except when we cross a point of type riij 
or riij + rii-i (where j is integer). But there are at most 2 points of this type in 
the interval of length k so there are at most 3 corresponding intervals. 

Substrings V(ao, bo), . . . , V(a s ~i, b s -i) (defined as in Lemma) can be com- 
puted by an algorithm using the given substring of uj. The algorithm needs only 
to know the value of m mod for finding elements with small rank (less 

than i) and the relative positions of elements of oj m . . .oj m+ k-i corresponding 
to v aj and i^-i where j = 0,1, . . . , s — 1. Because s < 3 only a logarithmical 
amount of additional bits is needed. So we can prove the following inequality to 
finish the proof of Proposition [5J 

K(V(ao, b ), . . . , F(a s _ x , 6 s _x)) > ak - 0(log k). 

We can use Lemma for this because a'L > ak, where L = (bo — ao) + (bi — a±) + 
. . . + (6 s _i — a s -i) (we have already proved that in this substring the density of 
elements of small rank is not greater than a' — a, hence k — L < (a' — a)k). 

If we prove that K(aj | Oj + i) ~ O(logfc) we will finish the proof of the 
proposition. Suppose we know aj+i. We can find a,j in the following way. Find 
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the element of the given substring of ui corresponding to v aj+1 . Add to the index 
of the found element the difference between the indexes of the elements of the 
given substring corresponding to v aj and v aj+1 (this difference is not greater 
than the lenght of the given substring, i.e., we use only a logarithmical amount 
of memory) . We get an element of u) corresponding to v aj ■ It can be used to 
calculate dj. But the first step of this algorithm uses knowing the position of 
the given substring which needs an unlimited amount of memory. We can avoid 
using this position if we notice that the rank i of elements of ui corresponding 
to v aj is not greater than the rank I of elements of u> corresponding to v a . +1 
(because aj < dj+i). So rii is a multiple of Uj. Hence at the first step we can take 
any element of ui corrensponding to v a , +1 (for example, the first one). We get 
the same result since the elements corresponding to v aj repeat with period n, 
and the elements corresponding to v aj+1 repeat with period nj. 

Therefore we construct the algorithm proving that K(aj | Oj+i) = O(logfc), 
and so the proof of Proposition [5] is complete. 

Remarks. 

1. Proposition[6]implies the existence of a bi-infinite almost periodic sequence 
with complex substrings (using the standard compactness argument; this argu- 
ment can be even simplified for the special case of almost periodic sequences). 

2. The proof of Proposition [5] works for relativized version of complexity. 
Therefore we get (as explained above) the following (pure combinatorial) strong 
version of Proposition [TJ 

Corollary. Assume that for each n a set F n of forbidden substrings of 
length n is fixed, and the size of F n is at most 2 an . Then there exists an in- 
finite almost periodic binary sequence ui and a constant N such that for any 
n > N the sequence u) does not have a substring x that belongs to F n . 

5 Multidimensional case 

Similar but more delicate arguments could be applied to multidimensional case 
too. 

A (^-dimensional sequence lu : 2Z d — > {0, 1} is almost periodic if for any cube x 
that appears in ui there exists a number k such that any subcube with side k 
contains x inside. 

Proposition 9. Fix an integer d > 1. Let a be a positive number less 
than I. There exists an almost periodic d-dimensional sequence ui such that any 
sufficiently large subcube x of U) has large complexity: 

K(x) > a ■ volume{x) 

Here volume is the number of points, i.e., side d . 

In the multidimensional case the complexity argument needs Proposition [8] 
even if we do not insist that u> is almost periodic. 

Informally, the idea of the proof can be explained as follows. Consider, for 
example, the case d = 2. Take a sequence v from Proposition [8] and write down 
its terms along a spiral. 
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Then we need to bound the complexity of a cube (i.e., square). This square 
contains several substrings of the sequence v. (Unlike the previous case where 
only 3 substrings were needed, now the number of substrings is proportional to 
the side of the square.) Then we apply the Lemma to these substrings to get the 
bound for the complexity of the entire square. 

This works if we do not require uj to be almost periodic (so the argument 
above could replace the combinatorial proof using Lovasz lemma). It needs 
additional modifications to get the almost periodic sequence. Similar to one- 
dimensional construction, the cube [— no,na) d is duplicated periodically in all 
directions with shifts being multiples of n\ (where no | ni); the cube [— ni,ni) rf 
is duplicated with shifts being multiples of ni (where ri\ | 712), etc. 

As in one-dimensional case, it is easy to see that this construction guarantees 
that uj is almost periodic. Let v be a sequence satisfying the statement of Propo- 
sition [8] with some a 1 > a. We sort all new positions of oj by rank (the element 
has rank j if it is duplicated with period rij by the structure described) then by 
coordinated in lexicographical order. Then we fill the positions with the elements 
of v in this order. Let B = [mi, m\ + k) x [rn 2 , mi + k) x . . . x [md, md + k) is a 
cube. We need only to prove that cube B in the sequence uj has high comlexity: 

K(ui B ) > ak d . 

Suppose that sequence {rij} grows fast enough, i.e. J2JLi < ~ 4 ~ - Sup- 
pose i is the smallest index such that n,; > k. Due to our construction of se- 
quence uj any element of uj corresponds to some element of v. Different elements 
of uj of rank not less than i in cube B correspond to different elements of v 
because the distance between elements of the given cube is less than (and less 
than the period). It is easy to prove that in this cube the density of elements of 
small rank (less than i) is not greater than a' — a. 

Indeed, the number of elements of rank j on any vertical (i.e., parallel to the 
last axis) interval of length rij is zero or 2rij-i and we can cover the given cube 
of side k with at most k d ~ 1 (^- + 1) vertical intervals of length rij. Therefore the 

number of elements of rank j on cube B is not greater than 2nj-\k d ^ 1 (j^- + lj . 
So the density of elements of rank less than i in the given cube is not greater 

than 2 X)J=i \ ~^~ + ^aT") — ^J2]=a ~jr J ' < a> ~ a ^ ue to our assumption 
about growing of {rij}. 



13 




Fig. 2. Duplicated cubes in two dimentional case. 



Hence cube B corresponds to some intervals in v. Throw away all elements 
of small ranks from these intervals of v and denote the remaining intervals by 
[do, bo), . . . , [a s _i, b s -i), where ao < 60 < . . . < a s _i < b s -\. The number 
of these intervals is at most 4k . Indeed, we can enumerate all elements of 
each vertical interval of length k in our cube from bottom to top (from small 
last coordinate to big one), not counting elements of small ranks, and for each 
element find the corresponding element of v. The index of corresponding element 
will increase by 1 every time except when we cross a point of type n,-j, riij+rii-i 
or rnj — rii-i (where j is integer). But there are at most 3 points of this type in 
any vertical interval of length k so there are at most 4 corresponding intervals for 
each vertical interval. But the number of vertical intervals of length k in cube B 
is equal to fc d_1 , so the total number of corresponding intervals s < 4A: d_1 . 

Substrings V(ao, 60), . . . , V(a s ^i, b s —i) (defined as in Lemma) can be com- 
puted by an algorithm using the given substring of u>. The algorithm needs only 
to know the value of rrij mod rij_i, where j — 1,2, ... ,d, for finding elements 
with small rank (less than i) and the relative positions in the cube B correspond- 
ing to v a . and Vb s -i where j = 0, 1, . . . , s — 1. Because s < 4/c d-1 the algorithm 
needs only 0(k d ~ 1 logfc) bits. So we can prove the following inequality to finish 
the proof of Proposition [SJ 

K(V(a , bo), Via.-ub.-i)) > ak d - O^- 1 log k) 

(the value 0(k d ~ 1 log A:) is compensated by a small change of a). We can use 
Lemma for this because a'L > ak, where L = (bo — ao) + (61 — 01) + . . . + (b s -i — 
Os-i) (we have already proved that in this cube the density of elements of small 
rank is not greater than a' — a, hence k — L < (a' — a)k). 

If we prove that K(cy | dj+i) = O(logfc) we will finish the proof of the 
proposition. Suppose we know dj+i. We can find dj in the following way. Find 
some element of ui corresponding to v aj+1 (for example, the smallest one). Add 
to the index of the found element the difference between the positions in the 
given cube corresponding to v aj and v aj+1 (this difference is not greater than 
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the side of the cube, i.e., we use only a logarithmical amount of memory). We 
get an element of uj corresponding to v aj . It can be used to calculate aj . This 
can be proven the same way as in Proposition |6] If at the first step we found the 
element in cube B corresponding to v a +1 we obviously would get v a as a result. 
Notice that the rank i of elements of uj corresponding to v aj is not greater than 
the rank / of elements of uj corresponding to v aj+1 (because Oj < aj+i). So ni is 
a multiple of n., and the result does not depend on the element corresponding 
to v aj+1 since the elements corresponding to v a . repeat with period rii and the 
elements corresponding to v aj+1 repeat with period nj. 

Therefore we construct the algorithm proving that K(aj | aj+i) = O(logfc), 
and so the proof of Proposition [9] is complete. 

6 Remarks 

Kolmogorov complexity is often used in combinatorial constructions as the re- 
placement of counting arguments. (Instead of proving that the total number 
of objects is larger that the number of "bad" objects we prove that an object 
of maximal complexity is "good".) Sometimes people even say that the use of 
Kolmogorov complexity is just a simple reformulation that often hides the com- 
binatorial essence of the argument. 

In our opinion this is not always true. Even without the almost periodicity 
requirement the two natural proofs of Proposition!!] (using complexity argument 
and Lovasz lemma) are quite different. The proof of Proposition [2] uses prefix 
complexity and cannot be directly translated into a counting argument. On the 
other hand, the use of Lovasz lemma in a combinatorial proof cannot be easily 
reformulated in terms of Kolmogorov complexity. (Moreover, for almost periodic 
case we don't know how to apply Lovasz lemma argument and complexity proof 
remains the only one known to us.) 
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